AITopics | epp score

Collaborating Authors

epp score

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Rethinking benchmark systems for machine learning

AIHubSep-3-2020, 08:56:51 GMT

Common methods applied in the evaluation of model performance share several limitations. There are many approaches to verify whether a new algorithm improves the performance compared to the previous state-of-the-art algorithms. The majority of them are testing procedures. In his paper Statistical Comparisons of Classifiers over Multiple Data Sets, Janez Demšar reviewed commonly used practices and pointed out the vast amount of problems with them. He analyzed papers from five International Conferences on Machine Learning (1999-2003) that compared at least two classification models.

algorithm, artificial intelligence, machine learning, (17 more...)

AIHub

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Interpretable Meta-Measure for Model Performance

Gosiewska, Alicja, Woznica, Katarzyna, Biecek, Przemyslaw

arXiv.org Machine LearningJun-2-2020

Measures for evaluation of model performance play an important role in Machine Learning. However, the most common performance measures share several limitations. The difference in performance for two models has no probabilistic interpretation and there is no reference point to indicate whether they represent a significant improvement. What is more, it makes no sense to compare such differences between data sets. In this article, we introduce a new meta-measure for performance assessment named Elo-based Predictive Power (EPP). The differences in EPP scores have probabilistic interpretation and can be directly compared between data sets. We prove the mathematical properties of EPP and support them with empirical results of a large scale benchmark on 30 classification data sets. Finally, we show applications of EPP to the selected meta-learning problems and challenges beyond ML benchmarks.

artificial intelligence, epp score, machine learning, (16 more...)

arXiv.org Machine Learning

2006.02293

Country: Europe > Poland > Masovia Province > Warsaw (0.05)

Genre: Research Report > Experimental Study (0.69)

Industry:

Education (0.54)
Leisure & Entertainment > Games (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

EPP: interpretable score of model predictive power

Gosiewska, Alicja, Bakala, Mateusz, Woznica, Katarzyna, Zwolinski, Maciej, Biecek, Przemyslaw

arXiv.org Machine LearningAug-24-2019

The most important part of model selection and hyperparameter tuning is the evaluation of model performance. The most popular measures, such as AUC, F1, ACC for binary classification, or RMSE, MAD for regression, or cross-entropy for multilabel classification share two common weaknesses. First is, that they are not on an interval scale. It means that the difference in performance for the two models has no direct interpretation. It makes no sense to compare such differences between datasets. Second is, that for k-fold cross-validation, the model performance is in most cases calculated as an average performance from particular folds, which neglects the information how stable is the performance for different folds. In this talk, we introduce a new EPP rating system for predictive models. We also demonstrate numerous advantages for this system, First, differences in EPP scores have probabilistic interpretation. Based on it we can assess the probability that one model will achieve better performance than another. Second, EPP scores can be directly compared between datasets. Third, they can be used for navigated hyperparameter tuning and model selection. Forth, we can create embeddings for datasets based on EPP scores.

artificial intelligence, epp score, machine learning, (15 more...)

arXiv.org Machine Learning

1908.09213

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)

Add feedback